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Abstract 

For any branching process, we demonstrate that the typical total 
number r mp (z/r) of events triggered over all generations within any 
sufficiently large time window r exhibits, at criticality, a super-linear 
dependence r mp (i/r) ~ {vt)" 1 (with 7 > 1) on the total number vr of 
the immigrants arriving at the Poisson rate v. In branching processes 
in which immigrants (or sources) are characterized by fertilities dis- 
tributed according to an asymptotic power law tail with tail exponent 
1 < 7 ^ 2, the exponent of the super-linear law for r mp (i/T) is identical 
to the exponent 7 of the distribution of fertilities. For 7 > 2 and for 
standard branching processes without power law distribution of fertil- 
ities, r mp (ur) ~ (vt) 2 . This novel scaling law replaces and tames the 
divergence vt / (1 — n) of the mean total number Rt(r) of events, as the 
branching ratio (defined as the average number of triggered events of 
first generation per source) tends to 1. The derivation uses the formal- 
ism of generating probability functions. The corresponding prediction 
is confirmed by numerical calculations and an heuristic derivation en- 
lightens its underlying mechanism. We also show that Rt{ r ) is always 
linear in vt even at criticality (n = 1). Our results thus illustrate the 
fundamental difference between the mean total number, which is con- 
trolled by a few extremely rare realizations, and the typical behavior 
represented by r mp (vT). 
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1 Introduction 



Our goal is to investigate the statistics of offsprings in a general class of 
branching processes, including the standard versions as well as processes in 
which the immigrants (or sources) are characterized by fertilities distributed 
according to an asymptotic power law tail with tail exponent 7 > 1. Branch- 
ing processes of interest are characterized by the so-called branching ratio 
n, defined as the average number of triggered events of first generation per 
source. Three regimes occur: subcritical (n < 1), critical (n = 1) and su- 
percritical (n > 1). Here, we focus on the vicinity of the critical regime 
n = 1. 

To make the presentation concrete, we consider a general class of branch- 
ing processes, known as the Hawkes self-excited conditional Poisson pro- 
cess [IHI], which is the simplest Poisson process in which past events have 
the ability to trigger future events. Many works have been performed to char- 
acterize the statistical and dynamical properties of this class of models, with 
applications ranging from geophysical [5TfT2~]. medical JT3] to financial sys- 
tems, with applications to Value-at-Risk modeling [13] . high-frequency price 
processes [15], portfolio credit risks [IB] , cascades of corporate defaults [17] . 
financial contagion [IS], and yield curve dynamics [19]. 

The paper is organized as follows. In section 2, we define mathematically 
the Hawkes branching model and present briefly the formalism of generating 
probability functions (GPF) that is useful for our purpose. We also define 
the mean total number Rtfr) of events within a given time window (t, t + r) 
and its main properties. Section 3 recalls useful results based on GPF and 
gives the general expression for the probability p(r; vt) that the total number 
i?t( r ) of events within the time window (t, t + r) is equal to a given integer r 
(i? t (r) = r), as well as its asymptotic behaviors. Section 4 derives our main 
results by comparing the dependence of the mean versus the mode of the 
distribution p(r; vt) as a function of the average number vt of source events 
within a given window of duration r. Section 4 contains both a generalization 
of the derivation of the linear scaling for the mean total number and the 
derivation, both rigorous and intuitive, of the super-linear scaling of the 
typical total number of events. Numerical calculations are presented, which 
illustrate and confirm the analytical results. Section 5 concludes. 
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2 Definitions and formalism 



2.1 Definitions for the Hawkes self-excited conditional 
Poisson process 

In mathematical terms, the Hawkes model is specified via its conditional 
Poisson intensity (or rate) X(t\H t ,Q), defined as the limit for small time 
intervals A of the probability that an event occurs between t and t + A 
divided by A, given the whole past history H t , which is a function of the 
whole past according to 

\(t\H t ,e) = v(t)+J2 r Mt-ti) , (i) 

i\U<t 

where the symbol represents the set of parameters defined below. Here, 
u(t) represents the external background sources occurring according to a Pois- 
son process with intensity v(t), which may be a function of time. The history 
Ht = {^iji^ii, u <*<*i +i corresponds to all the timestamps of all events that 
occurred before the present time t. The sum in expression (CO) runs over 
all past events, spontaneous or triggered. Therefore, the Hawkes process is 
non-Markovian. However, it can be mapped exactly to a branching process, 
in the sense that each event can be tagged with its direct first-generation de- 
scendants, their first-generation descendants and so on. Aggregating all the 
Markov branches recovers the Hawkes process. It is important to realize that 
the mapping to a branching process is probabilistic, in the sense that a given 
genealogy has a probability weight and constitutes just one micro- realization 
equivalent to the whole history [20H23] . This mapping to a branching pro- 
cess can be rephrased as implying that each background event occurring at 
a rate u(t) triggers, independently of each other, its "aftershocks" of first 
generation, each of them triggering also independently of each other their 
own descendants and so on, according to a branching process statistics. 

Each event i that occurred at time £j has a fertility r*, defined as the 
expected number of events of first generation it can trigger. We assume that 
the distribution of these fertilities p±(r) is such that its statistical mean f 
exists. We refer to it as the branching ratio 

oo 

n = f = r • pi(r) , (2) 

r=0 

which is defined as the average number of triggered events of first generation 
per mother event. In the following, we will assume a specific power law tail 
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for p\(r), the probability that the total number of first generation events 
triggered by a single given source is equal to r, in the form 

^ r )^f(^)"^' ^t 1 ' 2 )' r>>1 - ^ 

The condition 7 > 1 ensures that the mean f = n is finite. The other 
bound 7 < 2 corresponds to the interesting regime where the variance of 
r is mathematically infinite, for which anomalous scaling occurs [21]. This 
regime 7 < 2 is also the relevant one to describe earthquake triggering [25] 
and has been recently documented in the distributions of commits and lines 
of codes per developer in open-source software projects [25]- We will also 
discuss briefly how our results extend to the case 7 > 2 and for pdf's pi(r) 
that are falling off at large r's faster than power laws, such as exponentials 
or even distributions with compact support. In expression ([3]), the scale 
parameter k obeys the inequality 

< 7K < n, (n ^ 1) , (4) 

which ensures that ^ Pi{0) ^ 1 and ^ ^ 1, as seen from expression 
® below. 

In the formulation of the Hawkes process (jTJ, the influence of each past 
event to the intensity \(t\H t ,Q) is mediated by the memory function or 
bare kernel h(t — which quantifies the time-dependence of the triggering 
potential of a previous event that occurred at time to produce a new 
event at time t. Specifically, r»/i(t — t») is the intensity (mean rate) of first 
generation offsprings (daughters) of a mother event % with average fertility 
Ti that occurred at time t^. It follows from the time-dependent branching 
process theory that the intensity Tih{t) is tied to the probability density 
function (pdf) f(t) of the random waiting times between the triggering event 
at time ti and its triggered events of first generation at all possible future 
times t > tf. h(t) = f(t). In other words, the parameterization chosen in 
expression (pQ) is such that h(t) is normalized: J °° h{t)dt = 1. 

Equivalent to the specification of the distribution p(r) of the random 
fertilities, we introduce the corresponding generating probability function 
(GPF) of the fertilities of first generation triggered events: 

00 

G 1 (z) = 5>i(rK ■ (5) 

r=0 

We assume the form 

G x {z) =l-n(l-z) + «(1 - zy + 0[(1 - z)% 7 e (1, 2) , (6) 
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where the expansion in omits terms of order (1 — z) 2 and higher. Using 
the form ([H]), we have pi(0) = 1 — n + k, = n — and 



whose asymptotic for large r is given by expression ([3]). 

In summary, the set of parameters involved in the Hawkes model, denoted 
by the symbol G, includes the branching ratio n that controls the average 
fertility of events, the parameters 7 and k of the pdf pi{r), the time scale 
and exponent of the memory function h(t). The term u{t) represents the 
external background sources occurring according to a Poisson process with 
intensity v{t), which may be a function of time. All other events are triggered 
by previous events, either background sources or previously triggered events 
of earlier generations. Each event can in turn trigger their own offsprings. 
This gives rise to the existence of potentially many generations of events, 
depending on the value of the branching ratio n. The regimes n < 1, n = 1 
and n > 1 are respectively called subcritical, critical and supercritical. 

2.2 Definitions of mean numbers of events 

The interpretation of the Hawkes process as the superposition of many 
independent branching processes has very useful consequences. Recall that 
the branching process representation means that one can reproduce exactly 
the same time series of events by considering that each first generation event 
triggered by some background source triggers independently its own first gen- 
eration aftershocks (which are therefore second generation aftershocks of the 
initial background source) and so on. In other words, any aftershock trig- 
gers independently its own branching process, which is statistically identical 
to the branching process generated by any background event occurring with 
rate u(t). 

Under these assumptions, the generating probability function (GPF) G(z) 
of the total number R over all generations of the events triggered by some 
background source satisfies the transcendent equation (see, for instance [21]): 



This expression embodies the sum of triggered events over all branches and 
over all generations issued from a single external source. 

In the language of GPF, the mean number of the first generation after- 
shocks, or branching ratio, is given by 




r > 1 , 



(7) 






2=1 



(9) 
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which is equivalent to definition 02]). Together with equation (jSJ), one can 
easily show that the mean number of triggered events from one source over 
all generations is equal to 



The divergence of the statistical average of the total number of triggered 
events as the branching ratio n approach the critical value 1 (one daughter 
per mother on average) is the symptom of the transition from the subcritical 
to the supercritical regime occurring at the critical value n c = 1 of the 
branching ratio. 

Let us now consider a time interval (t, t + r) and investigate the statistics 
of the number i?t(r) of events occurring within that window. In the following, 
we assume that the rate v of exogenous background source events is constant. 
Then, the mean value of the number R sourC c( T ) of background sources within 
that window is equal to the rate v of its generating Poisson point process 
multiplied by the window duration r: -Rsource('r) = u ■ r. Since each source 
event triggers independently its own aftershocks, with mean number given 
by expression (fTOj) . we thus obtain that the mean rate z/ a n including sources 
and their aftershocks over all generations is equal to 

n v 

^all = V + ^aftershocks — U + U ■ = . (11) 

I — n 1 — n 

Accordingly, the mean number Rt(r) of all events occurring within (t, t + r) 
is equal to 



Notwithstanding the fact that Rtfr) is infinite at n — 1, the process is still 
stationary, if the rate of arrival of the external sources is stationary. And it 
makes sense to discuss typical realizations with bounded numbers of events 
in any finite time window, as we show below. Indeed, the divergence of -R*(t) 
is controlled by rare realizations weighed by a heavy power law tail such that 
its mean does not exist (exponent less than 1). 

3 Statistics of windowed event numbers 

3.1 Large window approximation and corresponding 



We investigate the statistical properties of the number of events occurring 
within an arbitrary time interval (t,t + r), in the limit where the window 




(10) 




(12) 



GPF 
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is sufficiently large, or the memory kernel h(t) is sufficiently short-range, 
such that the main contribution to the number of events in (t, t + r) comes 
from the source events and their aftershocks that occur within that window. 
Under these conditions, without essential error, one can approximate the 
number of events occurring within (t, t + r) that are triggered by each source 
in that window by their total number of triggered aftershocks. Note that the 
condition that r should be sufficiently large, or h(t) should be sufficiently 
short-range, does not necessarily mean that the number of events within 
(t, t + r) is large. Indeed, if v is sufficiently small, then the mean value given 
by ffl2]) may also be small, even in the near critical case n < 1 and within 
the large window approximation. We have previously extensively studied the 
large window approximation and refer to the corresponding articles [T^,[27] 



In mathematical language, this translates as follows. Consider the GPF 
©noise (z) of the number of source events within (t, t + r). Using the fact that 
these background source events follow a Poissonian statistics, we have 

e noisc (z;r) = e-^ 1 ^ . (13) 

Using the replacement 

z h-> zG{z) (14) 

in the r.h.s. of expression ({TBI , where G(z) is the GPF of the total number 
of events triggered by some source event given by equation (fIU|) . we get the 
expression for the GPF of the total number of events of all generations in 
(t, t + r) triggered by all sources before and within (t, t + r): 

Q( Z]T ) = e -^ l - zG ^ . (15) 

3.2 Mathematical expression of the distribution p(r; tjt) 
of event numbers 

Let us denote by p(r; z/r) the probability that the total number -R*(r) 
of events within the time window (t, t + r) is equal to a given integer r 
(i?t(r) = r). We have previously derived the following expression [21] 



p(r;ur) = ^^[G[(z).e^] 



r > , (16) 

z=0 



which holds in our case within the large window approximation. Expression 
ffTB"]) can be reformulated as the following Cauchy integral 



p( r; vr ) = ¥L I G\ (e^) exp [vt (e** - l) + z(l - r)</>] d<p . (17) 
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3.3 Asymptotic behavior of p(r; vt) 

For vt ^> 1, the main contribution to the Cauchy integral (TPT1) is provided 
by values of close to 0. For small 0's, one can expand ~ 1+20 and extend 
the two limits of the Cauchy integral to ±oo. This yields the asymptotic 
relation 



p(r; vt)- — / GUI + %4>y- l(T - VT - x ^d4> . (18) 
Expanding Gi(z) given by expression (JHD for small reads 

+ = (1 + in0 + /<-i0) 7 ) r ~ e «»r<i>+»r{-iW ^ r >l. (19) 

Substituting this expression in the integral in the r.h.s of expression (ITS]) , we 
obtain 

,, r poo 

p(j- vr )~ / e ^(-^)^-i[r(l-n)-^-l]^ _ ( 2 Q) 



2irr 



oo 



We introduce the change of integration variable (/tr) 1 / 7 = u, which 
transforms relation (l2Up into 



^^^^1/"'"^"' (21) 

with the following definition of the auxiliary variable 

x = r(1 ~f ~J T - 1 . (22) 

Expression f[2"Tj) has the form of a stable distribution. Defining the standard 
stable distribution / 7 (x) by its Fourier transform 

POO 

/»:= / f,(x)e lux dx = e^ 1 , (23) 



oo 



'7V / • / ^7 

-oo 



relation (1211) transforms into 



/ \ VT ? t \ VT p f r(l — n) — vt — 1\ . s 

P(r; -) - ^^/7(^) = ^77/7 ^ ^ ) ■ (24) 

For i/r ^> 1, the "exact" expression (|17[) and its asymptotic approxi- 
mation for the probability p(r; vt) are close to each other, including 
their shapes around their modes (value of r for which which p(r; vt) attains 
its maximum), which practically coincide. For comparison, figure [1] plots 
p(r; vt) given by the two formulas flTjl and (121]) as a function of r. 
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4 Mean versus mode of the distribution p(r] vr) 
of event numbers 



4.1 Linear scaling of the event number mean Rt(r) as 
a function of the source rate v 

In a stationary regime where the rate u(t) = v of background sources is 
constant, expression f 1 1 2 p shows that the mean number R±{t) of all events 
occurring with (t, t + r) is simply proportional to v and diverges as the 
branching ratio approaches its critical value n c = 1. 

We now show rigorously that the linear dependence of Rt{T~) as a function 
of v holds true for any r's, even when u(t) is varying with time as for instance 
when there are no sources for t < and the process starts at t — 0. In that 
case, expression ffT5]) for the GPF Q(z; r) of the total number of events of all 
generations in (t, t + r) triggered by all sources before and within (t, t + r) 
generalizes into 

where G(z, t) is the GPF that generalizes G(z) given by formula OH]) to ac- 
count for the finite life of the process and the finiteness of the time window 
(t, t + r). From the definition of Q(z; r), the following identity holds 

dO(z;r) 



Mr) = 



(26) 



dr 

It follows from expression (125]) that 

R t ( T ) =ut + u [ S(t, n)dt , (27) 



o 



where 



dz 



(2f 



=i 

is the mean number of events of all generations triggered by some source, 
which is counted from the time of the occurrence of the source till time t. 
Therefore, by its definition, S(r, n) does not depend on the rate v of sources, 
which implies that the mean number Rtij) of all events occurring within 
(t, t + t) is linear in v. In the stationarity case of a constant u, this linear 
dependence is expressed by relation f[T2T) . 

The divergence of Rtfr) at n — > n c = 1 is retrieved from a different 
perspective. Expression fl2l]) shows that, for 1 < 7 < 2, 

p(r, ut) ~ — \ for r ^> 1 and n = 1 . (29) 
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Since 1 < 7 < 2, then 1/2 < I/7 < 1, which implies that Rt{r) is infinite for 
n = 1 since it is the first-order moment of the distribution p(r, vt), which has 
a tail exponent I/7 smaller than 1. This retrieves the divergence of relation 
([12]) at n = 1. This divergence of the mean number Ri{t) of all events 
occurring with (t, t+r) is thus controlled by rare realizations with exceedingly 
large numbers r of events. However, these rare realizations do not represent 
the typical situation of a given system or of a given simulation governed by 
the Hawkes self-excited process. To obtain a better understanding of the 
critical regime n — 1 and of its neighborhood (n ~ 1), we need to study 
other moments or quantiles of the distribution p(r, vt) and how they behave 
as a function v and r. 

For 7 > 2 or for pdf's pi(r) falling off at large r's faster than a power 
law, expression f )2"$|) is replaced by 



p(r, vt) ~ —j , (30) 



1+- 

T '2 



which corresponds to the mean field distribution of cluster sizes in standard 
branching processes [29] . 

4.2 Super-linear scaling of the event number mode as 
a function of the source rate v 

The principal dependence as a function of vt of the mode r mp and of 
the g-quantiles r q of p(r; vt) is obtained by remarking that the slow power 
factor r~ 1-1 ' 7 in the r.h.s. of the asymptotic relation f |24|) does not influence 
significantly their determination. Let us call i] mp (j) and 77^(7) respectively 
the mode and the g-quantiles of the stable distribution / 7 (x). Figure |2] shows 
the dependence of ?7mp(7) as a function of 7 in the interval 1 ^ 7 ^ 2 of 
interest. Then, the mode and quantiles of p(r; ut) are given by 

(1 - n)r mp + 7] mp (y) [nr mp ] lh = vr + 1 , (31) 

and 

(l_ n ) rg + r/9 ( 7 )[ Krg ] 1 /7 = z/r + 1 ? (32) 

From these two equations, it is obvious that the dependences of r mp and of 
r q as a function of vr are identical up to a numerical constant. We therefore 
only discuss the dependence of r mp . Equation fl3"TT) yields 



r mp {VT) 



1 / VT 



^ \ 'Imp 
VT 



VT < 



VT > 

n 




(33) 
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1 

The second asymptotics for vt ^> ( j 7 1 coincides with the linear 

dependence of the mean number -R<(r) of events given by expression (1121) . 
This shows that, when n is not too close to 1, the mean number Rtir) and 
mode are identical asymptotically for large vt. 

The novel insight is provided by the first asymptotics r mp ~ (^t) 7 , which 
replaces the divergence as n — > 1 by a super-linear dependence with exponent 
1< 7 ^2. 

The two regimes in ( 133|) can be interpreted within the general frame- 
work of bifurcations or phase transitions, where the control parameter is the 
branching ratio n, the order parameter is the mode r mp of the total number 
of events and the external driving "field" is the total number vt of sources. 
Then, we have 



5 

i7 



r ^ = — 1 -" >> (^' (34) 

which expresses the standard linear relationship between order parameter r mp 
and external field vt, with the proportionality factor 1/(1 — n) corresponding 
to the susceptibility that diverges as the critical point n c = 1 is approached, 
with the standard mean field value exponent 1. At criticality or very close 
to it, expression (134)) is replaced by 



r mp -^-M 7 , l-n«^ r , (35) 



which is analogous to the nonlinear dependence of the order parameter on 
the external field at criticality, in the theory of critical phenomena [30J. 

These two asymptotic dependences (13"3"|) of r mp (z/r) can be checked by 
direct numerical calculations based on expression ()17p for p(r; vt) obtained 
with the large window approximation. Figure |3] shows the dependence of 
the mode r mp (z/r) as a function of vt for n = 0.99, k = 0.25 and 7 = 1.5. 
The two predicted super-linear and linear regimes fl3"3"|) are confirmed, while 
the amplitude of the super-linear regime has to be corrected according to 
r(z/r) ~ 1.6 • (vt) 1,5 . Figure H] is the same as figure |5] but for 7 = 1.75 for 
which the difference between the super-linear and linear regimes are even 
more striking. Figure |5] shows t{vt) for 7 = 1.75, k = 0.25 and for different 
values of the branching ratio n. The closer n is to its critical value 1, the 
larger the range of vt over which the super-linear regime t(vt) ~ (^t) 7 
holds. 
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4.3 Heuristic derivation of the super-linear scaling r mp (z/r) 

We now provide a simple intuitive derivation of the main result of this 
article that the typical dependence of the total number -Rt(r) of events within 
the time window (t, t + r) as a function of vt is super-linear (~ {vt) 1 with 
1 < 7 ^ 2) when the branching ratio 1 is close to or equal to 1. 

Recall first expression ff29l) describing the tail at large r of the probability 
p{r; vt) valid for n — 1. Actually, the same power law tail with exponent 
I/7 describes the distribution of the total number of events of all generations 
generated by a given background source, as first derived in Ref. [23] . As 
previously mentioned, the value of the exponent I/7 < 1 implies that the 
average number of events of all generations per source diverges. Along the 
lines of reasoning developed in Refs. [MJE2], such mathematical divergences 
valid for an infinite system translates into abnormal scaling laws for finite 
systems, here finite window sizes r and finite total number vt of sources. 

The reasoning goes as follows. Within a given window of duration r, there 
are typically vt source events, for a Poisson arrival rate v of the sources. Each 
source % of these vt source events produces a random number r $ of events of all 
generations. These random numbers {r 1; r 2 , r UT -\, r UT } associated with the 
vt sources are distributed according to p{r) ~ 1/r 1+ ~ at criticality n = 1 |24J . 
Let us call r max (z/r), the largest among this set {7*1, r2, r VT -i, r VT }. A 
good estimate of r max (z/r) is obtained by the condition that the probability 
r +oc ', > p(r)dr to find a source with a total number of triggered events equal 
to or larger than r max (z/r) times the total number vr of sources is equal to 
1. In other words, by the definition of r max (z/r), there should be only one 
source with such a total number of triggered events. This yields 

r ma x(H ~ (>t) 7 . (36) 

An estimate of the typical total number of triggered events r\ + r 2 + ... + 
r ur~i + r ur can be obtained as 

f\+r 2 + ■■■ + r VT _i+r UT w {vt) \ rp(r)dr ~ (^t) 7 , for 7 > 1 . (37) 

Jo 

Note the upper bound in the integral in estimation (|37|) of the total number 
r\+r 2 + ■■■+r UT ^i+r VT of triggered events by all the vt sources, which reflects 
the fact that the random variables {ri,r 2 , ...,r UT _i,r UT } are not larger than 
r mai (i/T) by definition of the later. In words, equation (157)1 states that an 
estimate of the typical value of the total number of all triggered events by all 
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sources in the time interval [t,t + r] is proportional to (z/r) 7 , which recovers 
the first scaling regime of expression (|33p . 

Using expression (l30i) and following the same line of reasoning, the scaling 
relationship (13"?]) is replaced by 

n + r 2 + ... + r VT -i + r UT ~ (z/r) 2 , (38) 

for 7 > 2 or for pdf's pi(r) falling off at large r's faster than a power law. 
This is confirmed by considering the exact derivation for the classical case of 
Poissonian statistics of first generation triggered events, for which the GPF 
of the fertilities of first generation triggered events defined by (pxj) reduces 
to Gi(z) = e n ^ z ~ l \ The corresponding exact expression for the probability 
p(r; z/r) defined by equation f|T6l) is 

p(r; VT ) = V -L. ( nr + vt) v ~ X • e""™ . (39) 
t\ 

Figure [6] shows the dependence of the mode r mp of p(r; z/r) given by (139]) as 
a function of z/r for n = 0.99, 

5 Concluding remarks 

Considering the Hawkes self-excited conditional Poisson processes, we 
have discovered a novel super-linear scaling relating the typical number r mp (z/r) 
of triggered events by background sources (or "immigrants" to employ the 
terminology of branching processes) to the total number z/r of these sources, 
valid when the branching ratio is critical (n — > 1). This novel scaling law 
r mp (yr) ~ ( Z7r ) 7 for 1 < 7 < 2 and r mp (z/r) ~ (^t) 2 for 7 > 2 replaces 
and tames the divergence z/r/(l — n) as n — > 1 of the mean total number 
of triggered events. The occurrence at criticality n = 1 of an infinite num- 
ber of generations of events, in which sources trigger daughter events, these 
daughters trigger their own daughters and so on, is the mechanism for the 
renormalization into a power law distribution of the total number of triggered 
events over all generations per source with exponent I/7 < 1 for 1 < 7 < 2 
and with exponent 1/2 for 7 > 2 [21], where 7 is the exponent of the power 
law distribution of triggered events of first generation per source. In absence 
of such a power law of first generation daughter per mothers, the distribution 
over all generations of events generated by a single source takes the standard 
mean field form of a power law tail with exponent 1/2, as given by expression 
( 1HU]) . Then, in the presence of this heavy tail distribution of renormalized 
fertilities over all generations of the sources at criticality, the typical total 
number of triggered events is dominated by the few most fertile sources in a 
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large deviation regime, which is responsible for the super-linear regime. We 
have derive this result rigorously using the formalism of generating probabil- 
ity functions. The corresponding prediction has been confirmed by numerical 
calculations. The essence of the mechanism for the super-linear law has also 
been dissected with an heuristic derivation. 

We also stress that the super-linear scale of the mode of the total number 
of events over all generations within any sufficiently large time window also 
holds for quantiles of the distribution p(r; vr) of the total number Rt(r) of 
events within the time window (t, t + r) that are not too far in the tails. In 
contrast, we have shown that the mean of i^t(r) is always linear in vr even at 
criticality. Our results thus illustrate the fundamental difference between the 
mean total number, which is controlled by a few extremely rare realizations, 
and the typical behavior represented by r mp (z/r). 

We will report elsewhere empirical evidence of the super-linear regime in 
open-source software development projects [26] . 
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Figure 1: Plots of the probability p(r;i>r) that the total number Rt(r) of 
events within the time window (t, t + r) is equal to a given integer r, given 
by the "exact" expression (JTTJ) and by the asymptotic approximation f[24|h 
for vr = 100, n = 0.99, 7 = 1.5, and n = 0.25. One can observe that the two 
curves differ insignificantly. 
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Figure 2: Dependence of the mode ?7 mp (7) of the stable distribution / 7 (x) as 
a function of 7. 
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Figure 3: Dependence of the mode r mp (i/r) as a function of ut for n = 0.99, 
k = 0.25 and 7 = 1.5, obtained by numerical calculation of the integral in 
expression (TTTj) . The dashed lines are the asymptotics (13"3"I) . While the super- 
linear dependence is correctly predicted, the amplitude is over estimated. 
The dotted line shows the same super-linear dependence as the dashed line 
on the left but with a corrected amplitude: r mp (z/r) ~ 1.6 • (z/t) L5 . 
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Figure 5: Dependence of the mode r mp (z/r) as a function of vt for k = 0.25 
and 7 = 1.75 and different values of the branching ratio n (top to bottom: 
n = 0.995; 0.99; 0.975; 0.95; 0.9). These curves are obtained by numerical 
calculations of the integral in expression (fl7|) . 
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Figure 6: Dependence of the mode r mp as a function of vt for n = 0.99, ob- 
tained from the exact expression (I59"|) . corresponding to the classical Hawkes 
process for which G\(z) = e n<yZ ~ l \ The two dotted lines show respectively 
the super-linear r mp (z/r) ~ (^t) 2 and linear r ~ 100 • vt asymptotics. 



20 



